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Abstract. A sorting network is a shortest path from 12 • • • n to 
n ■ ■ - 21 in the Cay ley graph of the symmetric group Sn gener- 
ated by nearest-neighbor swaps. A pattern is a sequence of swaps 
that forms an initial segment of some sorting network. We prove 
that in a uniformly random n-element sorting network, any fixed 
pattern occurs in at least cn^ disjoint space-time locations, with 
probability tending to 1 exponentially fast as n — >■ oo. Here c is 
a positive constant which depends on the choice of pattern. As 
a consequence, the probability that the uniformly random sorting 
network is geometrically realizable tends to 0. 

1. Introduction 

Let Sn be the group of all permutations a = (cr(l), . . . , cr(n)) of 
{1, . . . ,n} with composition given by ((Tr)(i) = (T(r(i)). We denote by 
o"j the adjacent transposition or swap (j j+1) = (!,... . . . ,n). 

A sorting network of size n is a sequence (si, S2, ■ ■ ■ , sn) of N := (2) 
integers with < < n, such that the composition as^<Js2 ' ' ' (^s^ 
equals the reverse permutation (n, — 1, . . . , 1). We sometimes say 
that at time k a swap occurs at position Sk, and we illustrate a sorting 
network by a set of crosses with coordinates {k, Sk) for k = 1, . . . ,N. 
(This is natural, since the crosses may be joined by horizontal lines to 
give a "wiring diagram" consisting of n polygonal lines whose order is 
reversed as we move from left to right; see Figure [H) 

Interest in sorting networks was initiated by Stanley, who proved 
in ^ that the number of sorting networks of size n is equal to the 
number of standard staircase-shape Young tableaux of size n, i.e. those 
with shape (n — 1, n — 2, . . . , 1). Uniformly random sorting networks 
were introduced and studied by Angel, Holroyd, Romik, and Virag in 
[AHRV] . giving rise to many striking results and conjectures. 
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Figure 1. Left: the sorting network (1, 3, 2, 1, 3, 2) of size 4, illus- 
trated by crosses corresponding to its swaps. Right: the associated 
wiring diagram. 

A pattern is any finite sequence of positive integers that is an initial 
segment of some sorting network. Thus for example, (1, 2, 1) and (4, 2) 
are patterns, but (1,1) and (1,2,1,2) are not. The size of a pattern 
is the minimum size of a sorting network that contains it as an ini- 
tial segment, which is also one more than the maximal element in the 
pattern. 

Let CO = {si, . . . , sn) be a sorting network of size n and let 7 be 
a pattern. Let C [1, A^] and [a,b] C [l,n — 1], and consider the 
subsequence ti, . . . ,ti oi Si, . . . , Sj consisting of precisely those elements 
lying in the interval [a,b]. We say that the pattern 7 occurs at time 
interval and position [a,b] (or simply at x [a,b]) if 7 = (ti — 
a + 1, . . . ,tk — a + 1), and no k E has G {a — 1, 6 + 1}. In other 
words, the swaps in the space-time window x [a,b] are precisely 
those of 7, after an appropriate shift in location, and there are no swaps 
at the two adjacent positions in this time interval. See Figure [2] for an 
example. 

We say that a pattern 7 occurs R times in a sorting network u 
if R is the maximum integer for which there exist pairwise disjoint 
rectangles {[v, jr] x [o-r, ^r]}.f=i such that 7 occurs at each. See FigureO 

Theorem 1. Fix any pattern 7 of size k. There exist constants Ci, C2 > 

(depending on '-/) such that for every n > k, the pattern 7 occurs at 
least cin? times in a uniformly random sorting network of size n, with 
probability at least 1 — e~'^^". 

We conjecture that the probability in Theorem [1] is in fact at least 

1 — e"^"^ for some c = 0(7). 

We will prove Theorem [1] by establishing a closely related result 
about uniformly random standard staircase-shape Young tableaux, and 
using a bijection due to Edelman and Greene pGj between sorting 
networks and Young tableaux. 



A PATTERN THEOREM FOR RANDOM SORTING NETWORKS 



4 

3 
2 
1 




123456789 10 

Figure 2. The pattern (2,1,2) occurs in the sorting network 
(1,3,2,4,1,3,4,2,1,3) at time interval = [4,7] and position 
[a, b] = [3, 4]. Note the requirement that the shaded regions contain 
no swaps. 
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Figure 3. Pattern (1,2) occurs 3 times in the sorting network 
(4,2,3,1,4,2,1,3,4,2). 



Write N= {1,2,...}. A Young diagram A is a set of the form 
e : 1 < j < Aj}, where Ai > A2 > A3 > ■ ■ ■ > are integers 
and Yl'i^i ~'- \M < The numbers Aj are the row lengths of A. 
In what follows we denote by (Ai, A2, . . . ) the Young diagram with row 
lengths Ai > A2 > . . . . We call an element x = G A a box, and 
draw it as a unit square at location (^, j) (with the traditional conven- 
tion that (1, 1) is at the top left and the first coordinate is vertical). A 
tableau T of shape A is a map from A to the integers whose values are 
non- decreasing along rows and columns. We call T{x) the entry as- 
signed to box X. A standard Young tableau is a tableau T of shape 
A such that the set of entries of T is {1, 2 . . . , |A|}. We are mostly 
interested in standard staircase-shape Young tableaux of size n, 
i.e. those with shape staircase Young diagram (n — 1, n — 2, . . . , 1). 
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Figure 4. A standard Young tableau T of shape A = (5, 5, 4, 3, 1), 
the subdiagram A^^'^\ the subtableau T*^^'^\ and a standard Young 
tableau identically ordered with T^^'^^ 



For {k,i) e we write (i, j) < {k,i) ii i < k and j < i. For 

a Young diagram A and a box G A, we define the subdiagram 
A(*'^) with top-left corner by A^*'^) := {{k,i) G A : > 
clearly A*^*'-'^ is mapped to a Young diagram by the translation {k, i) i— 
{k — i + 1, i — j + 1). If T is a tableau of shape A then we define the 
subtableau T^''-'-' to be the restriction of T to A*-*'-'-', and we call A 
the support of T^^'^\ 

We say that two tableaux S and T of the same shape A are iden- 
tically ordered if for all x,y & Xwe have S{x) < S{y) if and only if 
T[x) < T{y). Furthermore, if 5* and T are tableaux or subtableaux, 
and there is a translation 6 that maps (bijectively) the support of S to 
the support of T, then we say that 5* and T are identically ordered 
if for all X, y in the support of 5* we have S{x) < S{y) if and only if 
T{9{x)) < T{6{y)). Figure H] illustrates the above notations. 

Theorem [T] will be deduced from the following. 

Theorem 2. Let T he any standard staircase- shape Young tableau of 
size k. For some positive constants d^, and c'^ (depending only on 
k), with probability at least 1 — e~^^'^ , a uniformly random standard 
staircase- shape Young tableau of size n > k contains at least c'^n sub- 
tableaux with pairwise disjoint supports such that: 

(1) each is identically ordered with T; 

(2) all their entries are greater than N — C2n. 

As an application of Theorem [1] we prove that a uniformly random 
sorting network is not geometrically realizable in the following sense. 
Consider a set X of n points in such that no two points from X 
lie on the same vertical line, no three points are collinear, and no two 
pairs of points define parallel lines. Label the points l,...,n from 
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left to right (i.e. in order of their first coordinate). Let be the set 
obtained by rotating M? by angle about the origin, and let a^p be the 
permutation found by reading the labels in from left to right. As 
increases from to vr, the permutation changes via a sequence of 
swaps, which form a sorting network. Any sorting network that can 
be generated in this way is called geometrically realizable. (Such 
networks were called stretchable in [AHRVj . but this term is used with 
a different meaning in [GIi\ IGPj ) . 

Goodman and Pollack |GPj gave an example of a sorting network 
of size 5 that is not geometrically realizable. On the other hand, in 
[AHRVj . it was conjectured (on the basis of strong experimental and 
heuristic evidence) that a uniformly random sorting network is with 
high probability approximately geometrically realizable, in the sense 
that its distance to some random geometrically realizable network tends 
to zero in probability (in a certain natural metric). The conjectures 
of [AHRVj would also imply that, for fixed m, the sorting network 
obtained by observing only m randomly chosen particles from a uni- 
formly random sorting network of size n > m is with high probability 
geometrically realizable as n ^ oo. (The conjectures also imply that 
these size-m networks have a limiting distribution as n — t- oo, as well 
as providing a precise description of the limit. Certain aspects of the 
latter prediction were verified rigorously in |AHj . ) However, we prove 
that with high probability a uniformly random sorting network is not 
itself geometrically realizable. 

Theorem 3. The probability that a uniformly random sorting network 
of size n is geometrically realizable tends to zero as n tends to infinity. 

While our proof yields an exponential (in n) bound on the probability 
that a uniform sorting network of size n is geometrically realizable, we 
believe the probability is even 0{e~'^^ ). 

The paper is organized as follows. In Section 2 we recall basic def- 
initions and the Edelman-Greene bijection between sorting networks 
and standard Young tableaux. In Sections 3 and 4 we prove some 
auxiliary lemmas about Young tableaux and sequences of random vari- 
ables, respectively. In Section 5 we prove Theorem [2] and then deduce 
Theorem [1] as a corollary. Finally, in Section 6 we prove Theorem [31 

2. Sorting networks and Young tableaux 

Edelman and Greene |EGj introduced a bijection between sorting 
networks of size n and standard staircase-shape Young tableaux of size 
n, i.e. of shape (n — 1, n — 2, . . . , 1). We describe it in a slightly modified 
version that is more convenient for us. 
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Figure 5. A standard staircase-shape Young tableau, sliding 
paths (shaded) and the sequence of tableaux in the Edelman- 
Greene bijection. Here n = 4 and the corresponding sorting net- 
work is (2, 1, 3, 2, 3, 1). Vertical lines show the correspondence be- 
tween the positions of maximal entries in the tableaux and numbers 
Si, . . . , Sat of the sorting network. 

Given a standard staircase-shape Young tableaux T of size n, we 
construct a sequence of integers Si, . . . ,sn as follows. Set Ti = T and 
repeat the following for t = 1,2, . . . , N. 

(1) Let X = {n — j,j) be the location of the maximal entry in the 
tableau Tj. Set St = j. 

(2) Compute the sliding path, which is a sequence Xi,X2, ■ ■ ■ ,xi, 
such that Xi = X and for i = 1,2, . . . we define Xj+i to be the 
box among {xi — (1, 0), — (0, 1)} with larger entry in Tj, with 
the convention that Tj(x) = for every x outside the staircase 
Young diagram of size n. Let i be the minimal i such that 
Tt{x,) = 0. 

(3) Perform the sliding, i.e. define the tableau Tt+i as follows. Set 
Tt+i{xi) = Tt{xi+i) for i = 1, 1 and set Tt+i{y) = Tt{y) 
for all boxes y of the staircase Young diagram of size n not 
belonging to {xi, . . . , 

An example of this procedure is shown in Figure [51 Edelman and 
Greene |EG] proved that the resulting sequence of numbers is indeed 
a sorting network, and furthermore that the algorithm provides a bi- 
jection between standard staircase-shape Young tableaux and sorting 
networks. 

Now we fix n, consider the set of all sorting networks of this size and 
equip it with the uniform measure. The Edelman-Greene bijection 
maps this measure to the uniform measure on the set of all standard 
staircase-shape Young tableaux of size n. 

Given a standard Young tableau T of shape A with |A| = M we 
define a sequence of Young diagrams by 

X = {xe\: T{x) <M -i}. 
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Figure 6. Hook (clear) and co-hook (shaded) in a Young diagram. 

Thus A = A° D D ■ • • D A^^ = 0, and A* \ A*+^ consists of the single 
box T~^(|A| — i). If T is a uniformly random standard Young tableau of 
shape A, then conditional on A*, A*~^, . . . , A°, the restriction of T to A* 
is uniformly random. Thus the sequence of diagrams described above 
is a Markov chain. 

3. Some properties of Young tableaux 

In this section we present a fundamental result about Young dia- 
grams (the hook formula) and deduce some of its consequences. 

When drawing pictures of Young diagrams we adopt as usual the 
convention that the first coordinate i (the row index) increases down- 
wards while the second coordinate j (the column index) increases from 
left to right. Given a Young diagram A, its transposed diagram A' 
is obtained by reflecting A with respect to diagonal i = j. The column 
lengths of A are the row lengths of A'. 

For any box x = {i, j) of a Young diagram A, its arm is the collection 
of Aj — j boxes to its right: G A : j' > j}. The leg of x is the set 

E X : i' > i} of X'j — i boxes below it. The union of the box x, 
its arm and its leg is called the hook of x. The number of boxes in the 
hook is called the hook length and is denoted by h{x). The co-arm 
is the set G A : j' < j}; the co-leg is the set G X : i' < i}, 

and their union (which does not include x) is called the co-hook and 
denoted by C{x). See Figure [61 Finally, a corner of a Young diagram 
A is a box x G A such that h{x) = 1, or equivalently such that A \ {x} 
is also a Young diagram. 

The dimension dim(A) of a Young diagram A is defined as the num- 
ber of standard Young tableaux of shape A (thus named because it is 
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the dimension of the corresponding irreducible representations of the 
symmetric group). 

Lemma 4 (Hook formula; |FRT] ) . The dimension dim(A) satisfies 

lAI! 



dim(A) 



See e.g. [FRTj or [M| for a proof 



n h{x) ■ 



Corollary 5. Let T be a uniformly random standard Young tableau of 
shape X, and let x be a corner of X. The location r~^(|A|) of the largest 
entry is distributed as follows. 

F(T-H\X\) = x)=^^^^^^^^^^ = — TT ^^'^ 

(Note that h{z) > 1 for any box in the co-hook C{x), so the right side 
is finite.) 

Proof. This is immediate from Lemma HI □ 

Lemma 6. Fix i > 0. Let a Young diagram X be a subset of the 
staircase Young diagram of size n, and let x = {i,j) be a corner of 
X with i,j > n/3 and n — i — j < i. Let T be a uniformly random 
standard Young tableau of shape X. We have 

P(T(x) = |A|) > ^, 

where c is a constant depending only on I. 

There is nothing special about the bound | on i, j - the lemma and 
proof hold as long as z, j > en, though the constant in the resulting 
bound tends to as e — )■ 0. 

Proof of Lemma\^ The box {i — k,j) of the co-hook has hook length 
Xi_k - j + k + l<n-i-j + 2k + l<i + 2k + l. Similarly the box 
{i,j — k) has hook length at most £ + 2k + 1. It follows that 

' " " ' wLV'(*.^)-4iMa-)-i 



> — 



1 / TT i + 2k + lV 



n 



^2 V J-J- i + 2k 
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Figure 7. The co-hooks C(x) and C{y) (shaded) and the matched 
parts of the co-arms and co-legs (hatched in matching directions). 



(Here we used that the factors are all decreasing in /i, greater than 1, 
and that z,j > n/S.) It is now easy to estimate 

k<n/3 ^ ^k<n/3 ^ ^k<n/'i 

£ + 2ln/3j +2 

= TT-'-^ > cn 

£ + 2 

for some c = c{t). □ 

Lemma 7. Let T he a uniformly random standard Young tableau of 
shape \, let x and y he two corners of X and i = \\x — y\\oD- Then 

P(T-H|A|) = y) -^' + '^^''+'^- 

For our application all we need is a bound of the form C{i) on this 
ratio, though we note that the bound we get is close to optimal for a 
tableau of shape (n + 1, n, . . . , n) with i + 1 rows, for large n. 

Proof of Lemma To compare the expressions from Corollary [5] for x 
and y, let us introduce a partial matching between C{x) and C{y). We 
match boxes of the co-arm of x and the co-arm of y if they are in the 
same column. We match boxes of the co-leg of x and the co-leg of y if 
they are in the same row. All boxes inside the rectangle with opposite 
vertices x and y remain unmatched (see Figure [7]). 

Writing x = (ii, j'l) and y = {12,32) without loss of generality assume 
that ii < 12 and ji > j2- Clearly, if z G C(x) and z' G C{y) are a 
matched pair, then h{z') = h{z) ± s, where s = i2~ii +ji —32 and the 
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sign is plus if the box z belongs to the co-leg of x and minus otherwise. 
Let M(x), \J[x) be the matched and unmatched parts of the co- hooks 
and similarly for y. We have 

( h{z) 



where the choice of the sign ± depends on whether a box z belongs to 
the co-arm or the co-leg of x. 

Let us bound the right side of ([I]). First note that all the boxes 
in the co-arm of x and all the boxes in the co-leg of y are matched. 
The product over z G U{y) is at least 1. Next, there are at most i 
unmatched boxes of the co-arm of x and their hook lengths are distinct. 
Consequently 



hiz)-l - 11 m-l ^ 



Turning to the last product in ([T]), a matched pair of boxes from the 
co-arms contributes to ([I]) the factor 

h{z) 



h{z) - 1 



h(z) — s 
h{z) - 1- 

which is easily seen to be less than 1. 

Finally, every matched pair of boxes from the co-legs contributes to 
the factor 

h{z) 



hiz) - 1 / , s 



h{z) + s \ {h{z) - l){h{z) + s) 



h{z) - 1 + 

This is greater than 1 for any h{z). As z varies over a co-leg of x, the 
values of h{z) are distinct. Consequently, the contribution from the 
matched boxes from the co-legs is bounded from above by 



m 



n-^^ = lim ^— (s + 1) = s + 1 <2i + l. 
I m -|- s \ r-i^oo r -|- s 



m=2 



m — 1 + S 
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Multiplying all the aforementioned inequalities we get the required es- 
timate. □ 

4. Sequences of random variables 

Recall that a real-valued random variable Y stochastically domi- 
nates another real- valued random variable Z if and only if there exist 
a probability space fl and two random variables Y, Z defined on Q, 
such that Y = Y and Z = Z, and Y > Z almost surely. 

Lemma 8. Let Xi,...,Xn be random variables taking values in 
{!,..., m,oo} such that a.s. each a G [1,?^?,] appears exactly 
r times. Let Ai be events, and define the filtration Ti = 
a {Xi, . . . , Xi, Ai, . . . , Ai-i) . Assume F{Ai \ J^i) > p a.s. for some 
p > and all i. Let Ga be the event 

N 

Ga = f]{{X,y^a}UA,), 

i=l 

that is that Ai occurs whenever Xi = a. Then YlZ=i ^Ga stochastically 
dominates the binomial random variable Bin(m,p^). 

To clarify the lemma, it helps to think of having m counters initial- 
ized at 0. At each step, a counter is selected (or no counter, signified 
by Xi = oo), and that counter is advanced with probability at least p. 
The event Ga is that the ath counter is advanced every time it is se- 
lected. Then after every counter has been selected r times, the number 
of counters with the highest possible value r stochastically dominates a 
Bin(m,p''") random variable. Note that the order in which counters are 
selected may depend arbitrarily on the past selections and advances. 
While this lemma seems intuitively clear and perhaps even obvious, 
the precise assumptions on the dependencies among the events and 
variables make the proof slightly delicate. 

Proof of Lemma\^ First, we want to extend the probability space, and 
define events A'^ C Ai and a finer filtration in such a way that 
f>[A[ \J^'i)=p for all i. 

Let VL be our original probability space and let /x be our original prob- 
ability measure. For i = 1, 2, . . . , let be the set of all elementary 
events in the finite cr-algebra Ti that have non-zero probabilities (with 
respect to /i). For any E & £^ let f2f denote the probability space {0, 1} 
with probability measure /if such that /if (1) =p/¥{Ai \ E). Our new 
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probability space fl' is the product of fl and all flf: 

N 

In other words, an element of O' is a pair {u!,f), where co E fl and / 
is a function from |Ji^* to {0, 1} (here |J denotes set-theoretic disjoint 
union, so |Ji^' '■— i) : E e Si}). We equip Q' with the prob- 

ability measure /i' which is the direct product of /i and the measures 

N 

In what follows we do not distinguish between a random variable X{uj) 
defined on Q and the random variable X{u, /) := X{u) defined on Q! . 

In the same way wc identify any event A oiVl with A := {{u, /) G fi' : 
u E A} C Q'. In what follows all the probabilities are understood with 

respect to 

For any E e\_\-£^ let denote the random variable on Q' given by 

f{uJ)^f{E). 
Now for any E e C \J. £^ set 

Bf := {{CO J) en' \ cue E, f{E) ^l}^En{f^ = 1}. 
Denote 

Bi, - U Bf 

and let A[^ AiO B^iy 

Let us introduce a filtration on Q!: 

J-; = a(Xi,...,X,,A,...,A-i,{/''}), 

where E runs over all elements of Uj=i ■ 

Note that A- G Jj+i- We claim that P(A' | JJ) = p for every i. 
Indeed, this immediately follows from the definition of A[ and the fact 
that A[ is independent of all for E G Uj=i 

Moreover, consider any sequence of stopping times 1 < ri < • • • < 
n<N (w.r.t. the filtration T'). We claim that P = p^. The 
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proof is a simple induction in ^. For £ = 1 we have 



TV 



1=1 



TV TV 

= 5^P(ti = i)W>{K \n = i) = 5^P(ti = t)-p = P, 

i=l i=l 

where in the equahty (*) we used that | J-'l) = p and {ri = i} e 
J-'j'. Now assume that our statement is true ior £ — h — 1. Then for 

i = h we have 

^ if] ) = E ^(^1 = I ^1 = ^■)^ ( h < I 4 n {ri = j} ) ■ 

\j=l / j=l \i=2 J 

Note that for i > 2 the restriction of Tj on the set A'j fl {ti = j} again 
a stopping time. Indeed, by the definition, j < Ti < N on {ri = j}, 
and for k > j we have {tj < k} (1 A'j fl {ri = j} G J-"^, since both 
{Ti < k} & J^l and A^- e and {ri = j} G J-"^. Therefore, using the 
induction assumption we conclude that if P(74^- fl {ri = j}) > 0, then 

P(nt2 K I 4- n {^1 = j}) = P""^- Hence, 

'' h \ N TV 

^j=l / j=l j=l 
Now, let 

TV 

G'^ = f]{{X,^a}UA'^)CGa. 

1=1 

Applying the above claim to the r ordered stopping times Ti defined by 

{ti, ...,Tr}^{k:Xk^a} 

wc find P(G'a) — P^ ■ Moreover, for any set S C [l,m], by taking the 
r\S\ ordered stopping times r/^ defined by 

{ti, . . ■,Tr\S\} ^{k-.X^eS} 

we find 

\aG5 / 

It follows that the events G'^ are independent, and so 

m m 

EIg. >ElGi. = Bin(m,pO. □ 



d 

J-G' 

a=l a=l 
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Lemma 9. Let Xi,...,Xn be random variables taking values in 
{l,...,m, 00} such that a.s. each a G [1,"^] appears exactly r times. 
Denote Sk{a) := < k : Xi = a}. Let J^k = o'iXi, . . . ,Xk), and sup- 
pose moreover that for some c > and all a, k, on the event Skia) < r 
(which lies in J^k), we have 

P(Xfe+i = a\^k)>-. 

m 

Finally, let = i^{a : Sk{a) = r}. Then for every e > there are 
constants Ci,C2, depending on c,r but not on m or N , such that 

¥{D,,.^<{l-e)m) Ke-'^^. 

Proof. Let = YlT=i and note that > mr — em implies 

Dk>il-e)m. 

On the event Dk < (1 — e)m we have E(Tfc+i | J^k) — Tk > ce. 
Let Mk be cek — Tk, stopped when Dk exceeds (1 — e)m, then we 
see that Mk is a supermartingale with bounded increments. By the 
Azuma-Hoeffding inequahty for supermartingales (which follows from 
the martingale version by Doob decomposition; see e.g. [Azj or [W] 
E14.2 and 12.11]), for any Ci > there is a C2 so that P(Mcim > m) < 

If Mcjm < ^ and M is not yet stopped at time Cim, then 
Tcim > {cecim — l)m. If Ci is such that cecim—1 > r, this cannot hold, 
so M is stopped by time Cim with probability at least 1 — e"^^'". □ 

Corollary 10. Let Xi, Ai for i = 1, . . . ,N be two random sequences 
satisfying the assumptions of both LemmaslE andl^ Let G{a,i) be the 
intersection of the events Ga and {Si{a) = r}, i.e. 

N 

G{a,z) = {Si{a) = r} n f|({X,- ^ a} U Aj). 

Set Q{i) = ^a^G{ai)- There exist positive constants c\, 02, C3 (which 
depend on r, p, c, but not on m,N) such that ¥(Q{cim) > C2m) > 

If we again think about m counters, then the corollary means simply 
that after time Cim, with probability at least 1 — e~'^^"^, at least 
counters will have advanced r times. 

Proof of CorollarylTU Denote Q = J2a^Ga- Lemma [8] implies that 
Q stochastically dominates a binomial random variable. Thus, by a 
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standard large deviation estimate (see e.g. (Kj Chapter 27]), for some 
positive constants C4, C5 we have 

P(Q > c^m) > 1 - e-'^s"*. 

Take e = C4/2 in Lemma [9l It follows that for some ci with probability 
at least 1 — e~'^^"^ random variable Q{cim) differs from Q by not more 
than Cim/2. Thus, 

P(^g(cim) > C4m/2) > 1 - e-"«™. □ 

5. Proofs of the main results 

We are now ready to prove Theorems [1] and [2j We denote by T 
a standard staircase- shape Young tableau of size k and by T a uni- 
formly random standard staircase-shape Young tableau of size n. In 
what follows k and T are fixed while n tends to infinity. Given T, the 
idea is to consider cn specific disjointly supported subtableaux of T in 
columns [n/3j , . . . , [2n/3j and show that linearly many (in n) of them 
are identically ordered with T. 

Proof of Theorem [B Within the staircase Young diagram A of size n we 
fix m := [ri/ (3A; — 3)J disjoint subdiagrams Ki, . . . , Km of A in columns 
[n/3j, . . . , [2ri/3j, each a translation of a staircase Young diagram of 
size k. Let 9i be the translation mapping Ki to the staircase Young 
diagram. 

Let N := (2) and r := (2)- We now construct sequences Xt and At 
to which we shall apply Lemmas [H] and O as random variables on the 
probability space of standard staircase- shape Young tableaux T of size 
n with uniform measure. Set Xt = a if T~'^{N + 1 — t) belongs to Ka 
and set = 00 if T~^(A^-|- 1 — t) does not belong to IJ^ Ka- Note that 
each a G {1, . . . , m} appears exactly r times among Xi, . . . , X^. 

Next, we define the events At. If = 00 then At occurs. Otherwise, 
let a = Xt and suppose Xt is the ith occurrence of a among Xi, . . . ,Xt 
(or equivalently, N — t + 1 is the zth largest entry in Ka). If there is any 
s < t with Xs = a for which Ag does not occur, then At does occur. 
Finally, if the box T^^{N — t + is in the same position within Ka as 
T^^(r — z + 1) is within the staircase Young diagram of size k (in other 
words, if ea{T^\N - t + 1)) = f-\r - i + 1)), then At occurs. If it is 
not in the same position, then At does not occur. In other words. At 
fails to occur precisely if for some a, number t is the minimal number 
such that the locations of entries {N — t + 1, ... , N} imply that the 
subtableau supported by Ka and T are not identically ordered. 
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Rephrasing in terms of counters, we do the following. Recall that a 
uniformly random standard staircase- shape Young tableau T is associ- 
ated with a Markov chain of decreasing Young diagrams A*. Each step 
of this Markov chain is a removal of a box from a Young diagram. If 
the box X removed at step t belongs to K^, then we choose the hth. 
counter at this step. This counter advances if either the position of 
X is the correct one (as dictated by the order in T), or if the correct 
order of the entries of T inside was already broken before tth step. 
Clearly, if the /ith counter advances r times, then the subtableau of T 
with support Kh is identically ordered with T. 

Let us check that the sequences Xt and At, and the numbers r, m, 
A^, satisfy the conditions of Lemma [H] with 

As noted, every a G {!,..., m} appears among Xi, . . . , Xjy exactly r 
times. Thus, it remains to bound from below the conditional proba- 
bilities of At. Define J-j as in Lemma [8] and let W be an elementary 
event of J-f. We must prove that P(v4t | W) > p. If Xt = oo on W, 
then F{At \ W) = 1 > p. If some previous Ag with s < t and Xg = Xt 
did not occur (on W) then again F{At \ W) = 1 > p. 

In the remaining case, whether or not T belongs to At depends on 
the position of the box T~^{N — t + 1); specifically, T belongs to At if 
this box is the correct one according to T of the possible boxes in the 
subdiagram Kxf Let a denote the value of Xt on W. The Markov 
property of the sequence A* implies that 

F{At I W) = ^P(A*"^ =ij\W)- F{At I A*"^ = /x, W) 

(2) 

= J2 =I^\W)- ¥{At \Xt = a, A*-^ = /i), 

where the sum is taken over all Young diagrams n with |yu| = — t + 1 
boxes that are contained in the staircase Young diagram of size n. 

Let us bound P(A \ Xt = a, A*~^ = fi) from below. The condition 
Xt = a means that the box T~^{N — t + 1) is situated in the subdi- 
agram Ka- Thus, given Xt = a and A*~^ = fi, there are at most k 
possible positions for the box T^^(A^ + t — 1). Lemma [7] implies that 
the conditional probabilities of different positions differ at most by a 
factor of 2k'^ (since the parameter i in that lemma is at most k — 2). 
Consequently, the conditional probability of each position is at least 
l/{2k^). Exactly one of the positions corresponds to the event At. We 
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conclude that 



Hence, ([2]) gives 

Finally, let us check that the sequence Xt satisfies the conditions of 
Lemma [H Note that the condition St{a) < r means that the subdi- 
agram Ka is not filled by boxes T^'^{N — s + 1) with s < t. Thus, 
St{a) < r if and only if n Ka ^ 0. Note that the event {St{a) < r} 
belongs to J-'t- Let V be an elementary event from J^t such {St{a) < r} 
on V. Using the Markov property of the sequence A* we obtain: 

P(X,+i = a\V) = 5^P(A* = fi\V) P(X,+i = a\X' = fi,V) 

= 5^P(A* = ^i\V) P(Xt+i = a I A* = /i), 

where the sum is taken over the set of all Young diagrams /i with 
= N — t boxes that are subsets of the staircase Young diagram of 
size n. Since V C {St{a) < r}, we have P(A* = fi \ V) only for /i 
such that /i n Ka is non-empty. 

Consequently, in order to prove that 

P(Xt+i = a\V)>- 
m 

for some positive constant c, it suffices to show that 

c 



P(Xt+i = a I A* = ^) > 



for any Young diagram ^ contained in the staircase Young diagram of 
size n and such that H Ka is non-empty. Any such diagram /i has at 
least one corner inside Ka- Applying Lemma [6] for /i and this corner 
yields the required bound. 

Applying Corollary [TU] to the sequences Xt and At we get the state- 
ment of Theorem [21 □ 

We now deduce Theorem [1] using the Edelman-Greene bijection. 

Proposition 11. Fix any pattern 7 of size k. There exist constants 
C3, C4 and C5 (depending on'y) such that for every n> k, the pattern 7 
occurs at least c^n times within the time interval [1, 04,71] of a uniformly 
random sorting network of size n with probability at least 1 — e~'^-'^ . 
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Figure 8. "Padding" a tableau to get T. Here A; = 3. 



Note that Proposition [TT] differs from Theorem [T] in that we consider 
only the beginning of the network and hence only find a linear number 
of occurrences of 7. 

Proof of Proposition [Tl[ Clearly, it suffices to prove Proposition [TT] for 
patterns of the maximum length k{k — l)/2, in other words a sorting 
network of size k. Such a pattern 7 = (71, . . . ,7fc(fc-i)/2) corresponds 
via the Edelman- Greene bijection to some standard staircase- shape 
Young tableau of size k. Consider a larger standard staircase- shape 
Young tableau T of size k + 2, where entries of the hook of (1, 1) 
are the numbers 1, . . . ,2k + 1 (in an arbitrary admissible order) and 
the remaining staircase-shaped Young tableau of size k — 1 contains 
2k + 2, . . . , {k + 2){k + l)/2 and is identically ordered with T^. An 
example of this construction is shown in Figure [H 

Let C3, C4 and C5 be the constants c[, c'2 and C3 of Theorem [21 re- 
spectively. Let T be a standard staircase-shape Young tableau of size 
n having at least c^n disjointly supported subtableaux identically or- 
dered with T, furthermore, all the entries of these subtableaux are 
greater than — c^n. (Theorem [2] implies that a uniformly random 
standard staircase- shape Young tableau of size > /c is of this kind 
with probability at least 1 — e~'^^".) Suppose that the support of the 
ith such subtableau {i = 1,2,..., c^n) is a subdiagram Ki with top- 
left corner (n — ji — k,j^). Let K'^ denote the subdiagram with top-left 
corner {n — jf^ — k + l^ji + l) and note that the subtableau with support 
K[ is identically ordered with T^. 

Let u be the sorting network corresponding to T via the Edelman- 
Greene bijection. Note that in the Edelman-Greene bijection, every 
tableau entry moves towards the boundary of the staircase Young di- 
agram until it becomes the maximal entry in the tableau, and then it 
disappears and adds to the sorting network a swap in position j, where 
i is the column of the entry just before it disappeared. It follows that 



A PATTERN THEOREM FOR RANDOM SORTING NETWORKS 



19 



all the entries starting in Ki disappear in the columns ji, . . . ,j£ + k and, 
thus, add to the sorting network swaps Si satisfying ji < Si < jg + k. 
Furthermore, observe that all the entries starting in K'^ disappear (in 
columns Sj satisfying ji < Si < jg + k) before the entries in Kg \ K'^^. 
Finally, note that until all entries starting in K[ disappeared no other 
entry can disappear in columns j^, . . . ,je + k. 

We conclude that for every £, the pattern 7 occurs in u at [l,te] x 
[ji + l,ji + k — l]. Thus, pattern 7 occurs in u at least c^n times within 
the time interval [l,C4n]. □ 

Proof of TheoremUl Let C3, C4, C5 be the constants from Proposi- 
tion [TTl and let m := [04^]. For t = 1,..., [A^/mJ let It be the 
set of all sorting networks u of size n such that 7 occurs in u at least 
Can times within the time interval [{t — l)m + l,tm]. Proposition [TT] 
yields that P(/i) > 1 - e'"''''. 

A uniformly random sorting network (si, S2 • • • , Sjv) is stationary in 
the sense that (si, . . . , sn-i) and (s2, • • • , Siv) have the same distribu- 
tions (see [AHRVl Theorem 1]). Thus IP(/t) does not depend on t. 

There exist constants cq > and no such that if n > no, then 
[N/m\ e~'^^'^ < e"'^"". Let ci = min(^, ^) and C2 = min(c5,C6). Let 
/ denote the set of all sorting networks u of size n such that 7 occurs 
Cin^ times in u. If n > no then we have 

P(/) >^{P\I?) >l- ^(1 -P(/t)) > 1 - [N/m\ e-'^^" > 1 - e-'^^". 
t t 

And if < n < no, then /i C / and 

P(/) > P(/i) > 1 - e-"5" > 1 - e-"2". □ 

6. Uniform sorting networks are not geometrically 

realizable 

Proof of Theorem O Goodman and Pollack proved in the paper |GP] 
that there exists a sorting network 7 of size 5 that is not geometrically 
realizable. This sorting network is shown in Figure M (This is the 
smallest possible size of such a network.) 

Let us view 7 as a pattern. Suppose that 7 occurs in a sorting net- 
work u at time interval [l,t] and position [a,b]. We claim that w is 
not geometrically realizable. Indeed, if u were a geometrically realiz- 
able sorting networks associated with points xi, . . . ,Xn G (labeled 
from left to right), then 7 would be a geometrically realizable sorting 
network associated with the points Xa, . . . ,Xb- 
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Figure 9. A sorting network that is not geometrically realizable. 

Proposition [TT] yields that with tending to 1 probabihty 7 occurs 
within the time interval [1, €472] of a uniformly random sorting network 
u of size n and thus u is not geometrically realizable. □ 
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